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ABSTRACT 

speech synthesis and speech recognition systems offer 
access to communication and infoxitatlon for students with 
communication disabilities, thus eliminating ma:Jor historical 
barriers to learning for these students and allowing them to 
participate in the school environment. This guide describes two ways 
of producing speech synthesis: (1) by recording, analysis, storage, 
and play back of a human voice; and (2) by using a set of detailed 
pronunciation rules. Two types of speech recognition (isolated 
utterances and continuous speech) are also described. Examples of 
individuals with visual impairments, physical disabilities, and 
speech impairments illustrate applications of speech synthesis and 
speech recognition. The guide concludes with a list of communication 
aids and speech recognition systems, periodicals, organizational 
resources, and readings. (JDD) 
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Speech synthesis aod speech recogfiitkm systems are 
reshaping the hvea and edncaUon 3t«lents with 
dIsabfUUes. Speech ^thesis, also called syntheUc 
speesch. ackib a "teltdai^ dlmeimKm to |mxliM:ts smh as 
oanpnlera. caknilators, typewriters, watdies, and adding 
madimes. altowh^ ease mxsess to them by itfwIralB 
with visual bnpeJrme&te. dysfejda. and other disablUltes. 
The world of synthetic speech Is their access to 
communtcation and information, thus eliminating major, 
htstortcal harners to leamtng and altowlng them to 
partblpate to the school environment 

^U^Jte speech also serves as substitute speech tor 
nonvocal and nonverbal people. The Inability to spoak has 
oOsa limited the partSctpatton of sfmvErtial/QtmvDcal 
children In regular school prolans. Talking computere. 
partfcularly are a lechnotaglcal breaklhrouj^ 

that tnoreases the number oS siuiieato with dlsaMlltts 
who can be educated fuUy. Early speech products often 
had metallk: or echo-sounding voices ttet ixfM be 
dlfifeolt to understand. Many todays speedi prodmr^. 
however, have ncar-huroan-sounding votoe quality. The 
Improved quality has led to increased sales of speech 
products and a proUferaUon of the companies producing 
them. 

Speech Synthesis 

Synthette speech is produced in two ways: l)y analysis 
and by using rules. WHh synthesis by analysts, a human 
voice is recorded, analyzed, stored, and played back as 
needed. Before being Incorporated Into either software or 
hardware, thedi^tlzed vofce Is compressed: the computer 
then acts as a digitized OMnpater tape recorder. Entire 
words and phrases are stored In the computer's memory 
and then recalled on command. The h^»st quality 
speech output can be ach^vcd througb dtgttallmtkm at a 
sampling rate of 1 0.000 to 25,000 samples per second. 

With synthesis by rule, better known as text-toHspeech. 
the computer Is given a set of pronundatton rules to apply 
as it reads text. These rules specify sounds for 
components of words from phonemes, the most basic 
distinguishable sounds In language. The rules also specify 
how to combine these words to produce spoten wonte 
and phrases. Additional rules regulate pUch. determine 
which syllables are accented, and change Intonatton 
depending on wt^ther the srotence Is a qoestton or a 
statement. %nlhetlc speech by rule does not restrict 
vocabulary: The computer can synihesire any word by 
foUowtng the rules. However, because the sounds are 
arlificlalty produced, they often sound mechanical and 



rotmt-llke. Some text-to-speech products, such as 
Adaptive Communication Systems's Real Voice, have 
natural sounding s^ieech. 

Speech synthesisers are used to read aloud printed 
materials. For example, the Kurzwell Reading Madilne 
scans a page and reads It atoud. using sophisticated 
synthea^by-rute aj^gorithms to produced speech. Words 
are clearly enunciated and spoken with almost natural 
rhythm and inflection. Students with teaming dlsabUlUes 
who use this type of machine can be ffvcn textbooks 
augmented with speech. iMtructtons and screen text can 
be read aloud, reducing barriers created by poor nsdlng 
skil^. BUnd students can delect emm by listening to 
individual terters. spoken as they are typed. Abo. when 
ftnishai. the entire text can be read. Blind students can 
likewise edit computer pro^ammlng through speech 
syntheste code editors. 

Speech synthesis Is generally provided by hardware, 
either throu^ one or more microchips Instalted In the 
computer or by a separate peripheral with lis own power 
supply aod speaker that can be plugged Into a port on the 
computer. These producte can either be handheld, laptop, 
or the size of a desktop computer. The prices for products 
using speech synthesizers range from $300 for a 
synthesizer to more than $10,000 for the hardware. 
Software is less expeiwive than the hardware. 

Speech Synthesis Users 

The St. Lucy Day School of Philadelphia te a school for 
students with visual impairments. The 22 students 
ranging bom first to e^th grade have dally contact with 
a computer using arllfklal speech and receive at least 45 
minutes of Individual training weekly. The talking 
computers help keep the chUdren's attention focused on 
their subjects; in fact, the teachers have compared it to 
having a second teacher In the classroom. Sister Judith 
Ann MoeDcr. prtnclpal says. "The presence of speech 
output products has opened the workl of information for 
blind ttsers. The students have opportunities that they 
never had befive." 

Another exam;de ts Jenry PudUc of Farmlngton HlUs. 
Mk^^n. An e^tib-^rader at ^A^n^ Middle School Pud&k has 
a speedi laipaliiuaU as a result of being hit by a car . Sitting In 
his wheelchair, he be^ns answering questions using his 
RealVbfce. a lap&ip oammmdcadoos aid. "l have been usti^ 
speech oxmnunlcatkm akis fcr jeans. Ttey have made ft 
possible for me to a^xid school To speak to my SamHy and 
frtendalconU not exist terewfUioat II." says Pudtlk. 
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cBxxmA we a k^txKunL roll a aoeen up ikmn. extt a 
file, imxittce a spreadri^et m perfc^ i^d^ t^Gs. 
l\xiays speech im^cUt^ ^fatom atao 1^ 
pc^Uai tor the mmmcaVtoiimbal popolatton. After a 
ftey or two of traming» spea^^-finpalred d^dUktrra* ami 
adulter are i^b^ ^;Meech recpgiDdtfon sysi^GmB to ape^al 
sdiix^ ami In rehc^tattro tratntngp 

ThCTe are two types of speech recq^tttont teolated 
utteraaoes and conttnmms i^peech* ^c^ted wmk^ are 
stnj^ wimis or short phrmes, such as cmnptt^ 
oofflEimanfte* 

In an ^lated*utterance system such bb Dngpm 
Dictate* when a word to spoken, the microphone 
translates the word tnto an audio stgnal. The 
8pee<^-|»rocesslttg board omwrts the audio ^jsml 
tnto digital data that win be proccaaei by tte omipn^. 

As the ^)eech data (Aanu?t^^^ the w«ml enla:s tl^ 
comfHiter. a reco^ltlon prq^am ccmipares a to tte 
^xn^tlc word moctels rcsftiait In the ^cOvc vocalmlafy. 
Thcstattstfcal langiiaynwxielhdipssetertttenKwtM^ 
words socit to the dl^day. 

If the word spoken Is recc^Ized and correctly 
hltghll^ted In a displayed Choice Menu, the user 
ctmttnims tie dictation by spraldng the next wwd. which 
1$ then entered tnto tte text of apidto!attDii leg., wmd 
processor) and tte stattsttos fior t!^ wtxtd fa updated In 
the Ungual Moc^. Ifthe word spoken » nc^ highlighted 
but Is in tiie Clml!^ Menu, the uso* ran pu^ an 
ai^[Hoprlate functkm fe^ to select the wiHil. As tid^ 
word Is ii^ced tn the tort and the Langnagp Model Is 
updated. The lu^ou^ model in the Active Vocatmlary b 
also updated* 

If the WOTd taittot In tte Chotae ^fenu. the wcr either 
types* tn the H0mds4^ Chc^, t^peaks tlN? 
of the wwd. The Langoage 9todel sel»:te tte *flM»t 
Ukefy^wontelhsmtheAct^feVoc^ralary.Tl^^ 
any, are tt^c^ldfljvd in the Chc^Sifenu. Even with 1^ 
torgp VDcabufaries. a tyi^sl DragmDfctote user wfU see 
over 90% irf titeir ^pcdcen wmto f whfdi are in tte Active 
Vocabulary) on the first choice menu. 

€2o8(tihHsovu( ^^psaeii) 

For years peof^ Imve dreamed erf' Imvfi^ natural 
eonthiuous voice input tt»ir computers a{^pttcatkms. 
Continuous speech tB normal speech with multiple 
speakers and without pauses. O>nthiuous<apeech 
technology Is knowledge-tosed; it mak^ use of 
phcnmmes, tte ba^ ekmente of ^leech. Speech Sy^ems. 
Inc., for exam{^* Ims used phonemes to ctevek^ a system 
tliat recc^Qlres continuous speech* not cmfy wtmls tml 
entire sentimcesand fdurases. It i»wtcfcs iqpcakv models 
^te to recqgnire spe«A frmn a varl^ of ^N»l^^ 



The cqi^ates cm g^^al purpose ccrnismters. 

Its omipc»3»mto imilsvle a vi^ tofmt pert 
mgm am] g^mrto imie imi fonale speaker moitelsK 
runttnK recqputkm nmdule (|du»»tk! itecoderK and a 
devekvn-'to toc4ktt (ayntn complter. standard dtotkmary. 
amrikaittcm toilk^^ce imxlule, test aial ddn^ too^ ami 
p^'fi^nmnce mcamirmmmts toc^). 

Tbt techimtegy to matched to appttaaittoi^ such as 
trammgi^^is. didabsme TCce», cusrton^ service 
repmi^^^ms. A i^mtent fiM^exami^. can create ami 
review a rqpoit ami make CMTecthms frntmsdlBldty. 

Omse a uKidel to hmted. the iwr speaks tnto a 
mlcix^lrfmm ctevtoe cmuMC^ to a {dumelto csi^ne. Tl^ 
speedi Is conviated Into a string of pln»etto coites. If 
called fiar by an aj^dteatkm. i^cm^ codes may also 
ta^dttde Infommtkm cm {Mtdi. durnUtai. ami amfrfnmte. 

Tte i^onetto decoiler. a sdfhpare package 
language, runs m the host comj^ter, ft receives the 
phonetic cocte strings over an RS-232 serial toe from the 
phcmetic engine and converts them tnto word-string 
outfHit using the resources oS tim syntaai^. dlcttonartes, 
coartfctdatkm rules, ami speaker model £ach {rftonetfc 
string cotte CMtei^xnKte to a spcd^ uttorance and Is 
procfflsed as a unit 

Speech recognitlcm hardware units are needed to 
translate voloe commands to computer acttons. These 
units can be plug-ln circuit tKiards or external 
»eU^ntai]H»i units to toe with the com{mta'. Once 
Installed, ttey ipnerafl^ permit the user to operate off* 
Ihe-shdf software throt^ voice commands. Many units 
combine speech i^theste with speech reec^ttton. 

Speech Recognltton Usm 

Matthew Carbon lives in a Los Angetes sutairbu Five 
morning weekly, his lalha* Timothy places Matthew's 
speech recognttton sQ^tem in their van and transports 
lltetthew to a i»lvale schoc^ Matthew ki 1 4 years ami 
canmTt use his hands for writing, typing even turning 
th» pages oS a book* 

In ctass. Matthew sits erect tn hte wheetehalr and 
smiles coimtantfy. A mlO'oi^ne stts on the d^ near 
his computer. As the teacher spealm. Matthew speaks 
softly into hto microphone; his notes appear on the 
screen. Later he will review them for a test. Matthew 
a^ us^ hto compater to write his reports, do 
calculus and gecmietry fmibtoim. take tests, and mate 
tefefdtHme caSs. 

Matthew^ uncte Rtehard. a tether at the school, says, 
"Ifeinghls speech recq^ltton systrai. Matt can keep up 
with his dasmnatoa^ This sjnstem ami t^hers slml^ 
gjtm my nephew and other {mralyzed children the 
oppmrtimtty to atteml reguktr schools and gc» mi to cotl^ 
ami later to get a }otx** 
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"When I take a test, the l^cters let me take them iwlng 
my^ieech leco^Rton i^cm am^ber rooiiL it te^owe^ 
tban wrfUng, tnit stms I have ^vor wrttten a word in my 
Ufe wtlh has^. this speech feoc^ttao i^istem 
marvekMffi Im* mc." sajis Matthew. 

At home. Matthew^ craoipater Is hooked ap to another 
compu^ In about 10 minutes his fother. C^»t It ts 
ready, Matthew can makea telepkme call, turn ihe h^its, 
TV. and other appliances on and off. 

bi Chtea^. 12-yBat -c^ Mary Kes^ is 4»tlng htnch in 
the noisy schocri cafeteria with her frtends. Despite the 
noise she has a computer in front (tf her so she can use 
its i^ieech reco^tuon capsbtttUes. Ste kaA Issr arms m a 
1985 car accident and for two years, has been using a 
^leecfa recogpttkm intern In sdxxd and at him^. She 
caSs It t^kl Faithfur bei»ttse It has never let ha down. 

-I ase It for writing, making telephone calls, 
researching, m mathematics and science classes. In my 
ciMapata- prc^rammlng class. I am way ahemi of my 
fr^Mis because I <teveksp my own fM^igrBins." Aifary sa;^ 
gMul^. ^ learned to use the ^em in two days aiKl 
after three montto. comfi»1able eno^ with It to 
l»lng it to sdMxri. She keeps it there all the tfn^ stince she 
has another system at lumie. 

The futtue promises Improved speech and smaller and 
cheaper speech synthesis and rect^tlOB products. 
Some day there wiU be a handheld, lightweight 
aug^nentatlve coinmuntcatloss akl and a stmlter speech 
recognltton product devfce with much larger merooiy and 
stmage. 

Pfodiicts 

CwmwmHeatfcw Aids 

Adaptive Communicattott Systems. Inc.. 354 Hooksiown 
Grade Road. CUstcm. PA 15028: 412-264-2288. 

Artie Technotagles. 55 Park Street Suite 2. Troy. Ml 
48083-2753: 313-588-7370. 

Aubnnated Fuactkn». Inc.. 6424 North 28th Street. Ar- 
Unglon. VA 22307: 703-336-7741. 

Btazie En^eertng, 3660 Mill Green Road, Street. MD 
21028:301-879-4494. 

IXm Johnson Devdt^mwntal Equ^n^t. Inc.. P.O. Bcoc 
639. 1000 N. Rand Road. Budding 115. Wauconda. IL 
60064: 312-526-2682. 

Henter-Joyce. 7901 4lh Street. N., Suite 211. St. 
Petersbui* FL 33702; 613-576.5658. 

HumanWare. toe.. 6140 Horseshoe Bar Road. Suite P. 
Loomis. CA 95650: 916-652-7296. 

tamoccnnp. 3196 Wagon WhKi Drive, Sodon. OH 44691: 
216-262-1984. 

Kurzwell Computer Products. 185 Albany Street. 
Cambric^ MA 02139; 800-343-031 1. 



Luminaud. Inc.. 8888 Tyler BWd.. Mentor. OH 44060; 
216-255-9082. 

Prenlke-Romlch Co.. 1022 Heyl Road. Wooster. OH 
44691; 216-262-1964 or 800«42-6255. 

Saosrary Akis Oc^praratlon. Suite 122. 205 West Grand 
Avenne. BensenvUIe. IL 60106; 312-766-3935. 

Speech Enterprtses. P.O. Box 7986, Houston, TX 77270: 
713-46M006. 

TefesmsiHy Systena. faic., 455 North Bernardo Avenue. 
Mountabi View, CA 94039-7455: 4 1 5-960-0920. 

Venture Techimtagles. Inc., 304-134 Abbot Street. Van- 
couver. B.C. VeB 2K4. Canada: 604-684-9803. 

Speech Recr^ltkm Systems 

Artec Syslenw. Im:., 9104 Red Branch Road. Columbia. 
MD 2104a 

Bartiett Syatons. 1 1820 Parklawn Drive. Rocfcvttk:. MD 
20851. 301-231-930a 

Cherry Eli^trlcal S^tems. 3600 Sunset Avenue. 
Wauk^n. IL 60087. 312-360-3500. 

CfmsoUdated Systems Qrcwp. 6421 Hazclttme Bhrd.. 
Suite 159. Excelsior. MN 55331. 

Convox, Inc.. 675-D Conger Street. Eugene. OR 97402. 
503-342-1271. 

Dre^n Systen». 90 Bridge Street. Newton, MA 02158. 
617-965-5200. 

Hy-T^k. 1980 Rt. 30. Sugar Grove. IL 60554. 312-466- 
7664. 

bi^tate Vofce Products. 1849 West S«iuola Avenue. 
Orange. CA 9^68. 714-937-0910. 

Keytronic. P.O. Box 14667. Spokane. WA 99214. 

Kurzwell Al. 4 1 Waverly Oaks Road. Waltham. MA 02 1 54. 
617-893-5151. 

Mlcromlnt. Inc.. 4 Park Avenue. Vernon, CT. 06066. 203- 
871-6170. 

Microphonics Technology Corp.. 25-37 Street. NE. 
Auburn. WA 98002. 206-939-2321. 

MTI, Inc.. NE 29th Place. Suite 245. BeUevue. WA 98007. 
206^1-1789. 

NEC An^rlca. Inc.. 8 Old Sod Farm Road. MlhrlUe. NY 
11747. 516-753-7000. 

Scott Instruments Corp.. 1111 Wlltow Springs Drive. 
Denton. TX 78205. 817-387-9514. 

I^peech System. Im:.. 18356 Oxnard Street Tarzana. CA 
91356. 818-881-0885. 

Texas Iitttrumen^ Inc.. P.O. Box 2909, WS 2004. Aus- 
tin. TX. 512-250-7111. 

Tte Vt3$ai Cxmnedtaa, 17835 Skypark Qrcle. Suite C. 
IrvtncCA 92714. 714-261-2366. 
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Vok2e tmiimtr&s CtxrpJV&tlxx. 10 Madtaon Avenue* Mor- 
rison. NJ 07960. 201*^*7507. 

Vc^ Uamlng^ystema, 2265 Westwood Avenue, Suite 9, 
U»Angcte8,CA 90064* 714-^1*2366. 

Votan* 4487 Terhmricsgjr Drtve. Freemont CA 945^. 
413-490-7800. 

Ptssiodteuls 

ASHA, Amsr^n Speedi-Langjoage-Hewlng Association* 
10801 RDc&vlIle Pike* Itocfcvttfe. MD 208&2. 

Asa^tlue Tecftnofc^g^* the oflklal journal t3t RESNA. 
Demos Pulriteattnm. Im.. 156 Flltti Aynwt. Suite 
1016* New Ybrk. NY 10010. 

Aij^meniatflueomf Aitenuilfiie Ctor^ 

tton erf the Intematkmal Society fipf At^iCTtattve ami 
Altonattw Oraimunteattoa. Williams and W^dns* 
428 East Preston Strm. Bfldttnme* MD 2 1^. 

Ctosbig the Cap, P.O. Box GB. fteKfers(m« MN 1^044. 

CommmUcatkHt Outloak, Artificial Language Laboratory* 
Mtehl^ State Uatwrstty* 405 Computer Cento** 
East Lansing. MI 4^4-1042. 

SmMmaa 

AnHsrfcan Speech-Lai^S^ia^-lfear&^Anwxlat^ (ASHAI. 
10801 Rockvflle Pike. IbxrkvlJIe. MD 20^. 

AHtftctal Languid Ltfioratory. Mk^^ ^te Unlverst^* 
DqpartDient d Audkdpg^ ami ^leec^ Sdeooes* 405 Cora- 
putCT Center* Ewt Lm^ng Ml 48834*1042* 

IBM's Natkmal Suf^nrt Cteter fw Perscn^ with D&* 
abimies* P.O. Box 2150* Atlanta* OA 30055: 404*988* 
2733. 

Rehatiltttattcm EngM^eerlngCent^ m Ax^ffsmntBtlvc Com* 
munlcatkm, Untvor^U^ dTDdawarc* Departimnt ^rfCom* 
puter ami Informatkm Sconce* Newark. 1971 1. 

Ttie United States Society (or Augmentatlw and Attema* 
tive Communlcatjon* cjfo Barkley Memortel Center* 
Untverstty of Nebraska* Lbicoln. N£ 6858a 

Berliss* J. R,* Bimira^ P. A.* & Vieu«dlerl»^n« Q* C 
(19^ Trace resoimreboojc; AsstBttve tectmohgies 
for ammuntcatkm, contwL €uui am^mt^ €uxes$. 
Madron* WI: T)m:eRe8ean±amiDevek^m^tCen* 
ter. 

Burkhart L. J. (19871. IMng comfm^fs om< speech 
^ptthests to JacttUate ammimtcaUve tnteractkm 
u^th tfoijmg €atdA^ severely hamUcapped chlktren. 
Oolfege Park. MD: Author. 

Heisel*Dew^ V... 8( Van Der Meiracker. M» (1987). Hie 
tnteltl^bdtty of synthetic speedi to learning hand- 
tcaf^sed diUdren. Journal qf Special EducatUm 
Technoiogy. 9f IK 38*44. 



January 1990 



Kani»enbi^ P.* Marquardt T. R, & LaiBon. J, (1968). 
Speech tntdl^^t^ty of two mice cmtput communtea- 
tkm afcK Jmmnl qf CcmrmmbxiUon IHsonters, 
21(1)* 11-20, 

Lahrn* &A. (Bd.), ( 1 Techm^y u^th Um fticldei%ce 
BopvkMttons: flofripttig Access ioBd fa ^ ^ 
Rosm VA: TbeOenler iar %«4alBducs^ 

LaRlvkTC. C & »iert4om. J. (1986), Speech syntl^sas 
ami speech recc^itkm hy ndaocomputer. In Jerry 
L. Northern (Ecte.)* Tfw persi^ml computer Jar 
speecll* iof^uc^e* and fwartJ^ prqf^^onais. Bos- 
ton: Llttte&rown- 



This gukhe was developed by John M. WtlUams. 
Presktoit and Conmltant ei Intmiatk>nal Centers of 
Tpdimdci^ for Disalded Posons In Stertn^ \^r]^ta. 
J(dto lb ate the edtttu* and intbl^ter SAfltf^ 
ami Individual Needs Techw>tagyl» a nmnthfy consunw 
newsletter fi>r dlsabkid peopte. 



Tte tafivmatlcm to this Tlaeb Ilis ts tn the puUfc 
domain. Ite^ers are encouraged to Goj^ and idiare It but 
please credtt the Center for Special Education Technology. 
Please notfly the CSehter of laigc quantlQ^ dtelrtbutloM. 

TadiUseOi^kleson the Sc4towtng topics are avatlabte 
from the Center upmi request: 

Guide (or Te^t^fs 
Oukle Kn* Parrats 

T^nokagy for Work* Hon^ and Leisure 

Computer Access 

Selecting Software 

Speech Teehmrieg^ 

Preschoc^ Chikiren 

Hearing Impairments 

Qmpu^ and Wrltb^ 

Oo.apttters and Cooparatlve Leamli^ 

Vltaffil Impairaients 

Learning Disabled 

TelecommunlcatlDn Netwm-ks 

Aug^nratattw and Alternative Communk^atlon 

Mfid^Hamltcapped 

This material was devek^>ed by the Center for Special 
Educatkm Techi^ok^ umler Contract No. 300*87-0115 
with ti» Office of i^pectai EducaUon PtoffBsm, V.S. 
O^sartntCTt of EdueanML Tteomirat bowein^. does not 
neoasartfyreOecttbeposltKmor polk^ofOSEP/EDand 
no G^ik^ aackffsexnent <tf the matertal idKraM 
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